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(54) Synchronizing apparatus for a compressed audio/video signal receiver 



(57) A compressed audio/video receiver (200, 202, 
206), includes circuitry (217-220 222) to measure the 
relative synchronization of decompressed audio and 
video signals. If audio and video signal synchronization 
is within a first range of values, synchronization circuitry 
(215, 216) within the receiver will attempt to appropri- 



ately time align the signals. Alternatively, if actual syn- 
chronization exceeds the first range, attempts to 
synchronize the signals are suspended and non-syn- 
chronized decompressed audio and video signals are 
provided. 
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Description 

[0001] This invention relates to a method and appa- 
ratus for synchronization of audio and/or video compo- 
nents in a compressed audio/video (A/V) signal 5 
receiver. 

[0002] MPEG is a compressed video signal proto- 
col established by the Moving Pictures Experts Group of 
the International Standardization Organization. This 
protocol defines a versatile signal format which includes w 
both intraframe coding and motion compensated pre- 
dictive coding. Due to variations in coding format frame 
to frame, and variations in image content, different 
frames have widely divergent quantities of compressed 
data. As a consequence of different frames being com- 15 
pressed with different quantities of data etc., frames of 
data tend to be transmitted asynchronously. 
[0003] Audio signals may also be compressed 
according to an MPEG protocol. Compressed audio 
may be associated with video but transmitted independ- 20 
ently. For transmission the compressed audio is seg- 
mented into packets which are then time division 
multiplexed with the compressed video signal in a non 
synchronous manner. 

[0004] Associated compressed audio and video 25 
components are not only independently asynchronous, 
but their mutual temporal relationships or synchronism 
is nonexistent in transmission. 

[0005] MPEG compressed audio and video compo- 
nent signals may include presentation time stamps 30 
(PTS) to establish a reference between particular com- 
pressed signal segments and a system reference clock 
signal. The audio and video PTS's are utilized by 
receiver apparatus to both resynchronize respective 
decompressed components and to restore their tempo- 35 
ral interrelationship. 

[0006] An MPEG or MPEG like A/V receiver will 
provide the A/V component timing reference signals 
(PTS's) coincident with reproduction of associated 
decompressed component signals. Synchronization 40 
apparatus within the receiver will use the PTS's occur- 
ring in the audio and video components to maintain the 
audio and video components in synchronism. While the 
system is being brought into synchronization, and the 
audio is not lip-synced with video, the audio is typically 45 
muted because it is perceived that viewers would rather 
hear no audio, than audio which is not synchronous with 
the video, at least for short intervals. Muting will gener- 
ally occur for only relatively short intervals because 
there are relatively stringent requirements regarding the 50 
timing of the transmitted audio and video components 
for a given standard. Thus synchronization is guaran- 
teed to occur in a short interval. 
[0007] In receivers designed to decode standard- 
ized A/V components a problem exists if the receiver 55 
receives a non standard signal, or a standard signal, 
which has been demultiplexed and remultiplexed in a 
plurality of virtual transmission channels. In both cases, 



the relative audio and video component timing relation- 
ships may be variable. Lip-sync may not, and muting 
may occur for long intervals. This situation is untenable. 
[0008] A second problem exists regarding the 
method of synchronization when applied to non stand- 
ard signals. The synchronization process typically 
includes a coarse mode and a fine mode. Audio data is 
normally decoded ahead of video data and stored in 
memory. The stored audio is reproduced from memory 
when the appropriate video is decoded. However, if the 
audio in the nonstandard signal arrives too late, there 
will be no audio ready to reproduce and synchronization 
cannot occur. On the other hand, if the audio occurs too 
early, (relative to buffer size) there will be insufficient 
memory to store the audio and data will be lost. Again 
synchronization cannot occur, possibly for long inter- 
vals. 

[0009] It will be appreciated that the viewer will pre- 
fer that both video and audio be reproduced in these 
instances, rather than suffer long periods of audio mut- 
ing. Thus, there is a need for an adaptive synchroniza- 
tion system in digital A/V receivers to accommodate 
both standard and non standard received signals. 
[0010] An A/V receiver according to the present 
invention provides A/V component timing reference sig- 
nals (PTS's) coincident with the reproduction of associ- 
ated decompressed component signals. 
Synchronization circuitry generates a function of the dif- 
ference of occurring component audio and video PTS's. 
This function is indicative of relative audio and video 
synchronization. If the value of the function is within a 
certain range, the synchronization process is continued. 
If the value of the function exceeds a predetermined 
level, the synchronization process is terminated and 
non synchronized audio and video components are 
reproduced. In a further embodiment, for a given range 
of values of the function, the synchronization process 
continues, but the audio component is muted. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0011] 

FIGURE 1 is a block diagram of audio/video com- 
pression apparatus. 

FIGURE 2 is a block diagram of a audio/video 
decompression apparatus embodying the present 
invention. 

FIGURE 3 is a block diagram of apparatus for pro- 
viding the receiver system clock signal having sub- 
stantially the same rate as the system clock of the 
compression apparatus. 

FIGURE 4 is a flow chart of the operation of the 
FIGURE 2 apparatus. 

FIGURE 5 is a block diagram of alternative muting 
circuitry which may be implemented in the FIGURE 
2 apparatus. 



2 



3 



EP 1 071 290 A2 



4 



[0012] FIGURE 1 illustrates a typical system in 
which the invention may be practiced, which system is a 
compressed digital video signal transmission arrange- 
ment. In this system, video signal from a source 10 is 
applied to a video signal compression element 1 1 which 5 
may include a motion compensated predictive encoder 
utilizing discrete cosine transforms. Compressed video 
signal from the element 1 1 is coupled to a formatter 1 2. 
The formatter arranges the compressed video signal 
and other ancillary data according to some signal proto- 10 
col such as MPEG, a standard developed by the Inter- 
national Organization for Standardization (Organization 
Internationale De Normalisation). The standardized sig- 
nal is applied to a transport processor 13, which divides 
the signal into packets of data and adds certain over- 75 
head to provide some noise immunity for the transmis- 
sion purposes. The transport packets, which normally 
occur at a non-uniform rate are applied to a rate buffer 
14 which provides output data at a relatively constant 
rate conducive to efficient use of a relatively narrow 2 o 
bandwidth transmission channel. The buffered data is 
coupled to a modem 1 5 which performs the signal trans- 
mission. 

[0013] A system clock 22 provides clocking signal 
to operate much of the apparatus, at least including the 25 
transport processor. This clock will operate at a fixed 
frequency such as 27 MHz for example. As shown 
herein, however, it is used to generate timing informa- 
tion. The system clock is coupled to the clock input of a 
counter 23 which may be arranged to count modulo 2 30 , 30 
for example. The count values output by the counter are 
applied to two latches 24 and 25. The latch 24 is condi- 
tioned by the video source to latch count values on the 
occurrence of respective frame intervals. These count 
values are denoted presentation time stamps, PTS's, 35 
and are included in the compressed video signal stream 
by the formatter 12, and are used by the receiver to pro- 
vide lip-synchronization of associated audio and video 
information. The latch 25 is conditioned by the transport 
processor 1 3 (or the system controller 21 ) to latch count 40 
values according to a predetermined schedule. These 
count values are denoted system clock references, 
SCR's, and are embedded as auxiliary data within 
respective auxiliary transport packets. 
[001 4] Audio signal associated with the video signal 45 
from source 1 0 is applied to an audio signal compressor 
1 8. The compressor 1 8 provides frame sampling pulses 
(independent of video frames) to control a latch 19. 
Responsive to the sampling pulses, the latch 19 cap- 
tures count values provided by the counter 23. These so 
latched values correspond to audio presentation time 
stamps PTS aud . The PTS aucj are incorporated in the 
compressed audio signal provided by the compressor 
18. The compressed audio signal is coupled to a trans- 
port processor 1 7 which divides the signal into packets 55 
of data and adds certain overhead to provide some 
noise immunity for transmission purposes. The audio 
transport packets provided by the processor 1 7 are cou- 



pled to a multiplexor 16 which time division multiplexes 
the audio and video transport packets. In the figure, 
separate transport processors are shown in the audio 
and video signal processing channels. For systems 
wherein the data rate is moderate, the functions of the 
two transport processors and the multiplexor 16 may be 
subsumed in a single transport processor. 
[0015] The system controller 21 is a variable state 
machine programmed to coordinate the various 
processing elements. Note that the controller 21, the 
compressors 11 and 18, the transport processors 13 
and 17, and the rate buffer 14 may or may not operate 
synchronously via a common clocking arrangement as 
long as proper handshaking is provided between 
processing elements. However, the two compressors 
both derive PTS values from the same reference coun- 
ter 23, thus a precise timing relationship between the 
two compressed signals is provided in the compressed 
output signal. 

[0016] FIGURE 2 illustrates an exemplary receiver 
apparatus embodying the invention wherein the modem 
200 performs the inverse function of the modem 15 and 
the rate buffer 206 performs, in effect, the inverse func- 
tion of the rate buffer 14. An inverse transport processor 
202, divides respective transport packets by service 
and allocates the respective packet payloads differing 
memory blocks in the rate buffer 206. In so doing the 
respective transport packet signal payloads are sepa- 
rated from the auxiliary data, with the auxiliary data 
applied to the system controller 210. In an alternative 
arrangement a separate transport processor may be 
included in each processing channel and arranged to 
recognize and process only data associated with the 
respective channels. 

[0017] Compressed video data from the rate buffer 
206, is passed to the video decompressor 214. The rate 
buffer 206 accepts compressed video data at a bursty or 
inconsistent rate, and provides data on demand to the 
decompressor 214. The decompressor, responsive to 
the compressed video signal, generates non-com- 
pressed video signal for display or for storage etc., in 
appropriate display or storage devices (not shown). 
[0018] Compressed audio data from the inverse 
transport processor 202 is applied to the rate buffer 206, 
which provides compressed audio signal according to 
the system protocol to the audio decompressor 212. 
The decompressor 212, responsive to the compressed 
audio signal generates non-compressed audio signal 
for reproduction or for storage etc., in appropriate 
speakers or storage devices (not shown). 
[0019] The inverse processor 202 also provides 
SCR's from the auxiliary transport data, and control sig- 
nals to a system clock generator 208. The clock gener- 
ator responsive to these signals generates a system 
clock signal synchronous with at least the transport 
processor operation. This system clock signal is applied 
to the receiver system controller 210 to control the tim- 
ing of appropriate processing elements. 
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[0020] Figure 3 illustrates details of an exemplary 
clock regenerator 208. Data from the receiver modem 
200 is coupled to the inverse transport processor 202', 
including an auxiliary packet detector 31 . The inverse 
transport processor 202' separates transport header 5 
data from the respective transport packet payloads. 
Responsive to the transport header data, the processor 
202' demultiplexes payloads of the desired associated 
audio and video program components and payloads of 
associated auxiliary data. The audio video and auxiliary w 
payloads are written to separate memory blocks of the 
rate buffer 208. The respective memory blocks are each 
operated as first-in -first-out memories or FIFO's, writing 
data when it is available from the modem and reading 
data when it is demanded by the corresponding compo- 75 
nent signal processor (not shown). SCR's present in 
particular auxiliary packets are routed and stored in a 
memory element, 34. 

[0021] An auxiliary packet detector 31, which may 
be a matched filter arranged to recognize the code 20 
words designating an auxiliary transport packet contain- 
ing a SCR, produces a control pulse on the occurrence 
of transport packets containing such data. The control 
pulse is utilized to capture and store within a latch 35, 
the count value currently exhibited by the local counter 25 
36 at a time precisely related to the time of detection. 
The local counter 36 is arranged to count pulses pro- 
vided by a voltage controlled oscillator 37. The counter 
36 is arranged to count modulo M which may be, but is 
not necessarily, the same number as its counterpart 30 
counter in the encoder (counter 23). If M differs from N 
the difference may be accommodated in the error equa- 
tion. 

[0022] The voltage controlled oscillator 37 is con- 
trolled by a low pass filtered error signal provided by a 35 
clock controller 39. The error signal is generated in the 
following fashion. Let the SCR arriving at time n be des- 
ignated SCR n and the local count value concurrently 
captured in the latch 35 be designated LCR n . The clock 
controller reads the successive values of SCR's and 40 
LCR's and forms an error signal E proportional to the 
differences 

E=^ISCR n -SCR n . 1 I - ILCR n - LCR n . 1 I 

45 

The error signal E, is utilized to condition the voltage 
controlled oscillator 37 to a frequency tending to equal- 
ize the differences. As indicated previously, negative dif- 
ferences exhibited due to modulo counter wrap around, 
may be ignored. The error signal produced by the clock 50 
controller 39 may be in the form of a pulse width modu- 
lated signal, which may be rendered into an analog 
error signal by implementing the low pass filter 38 in 
analog components. 

[0023] The constraints on this system are that the 55 
counters at the two ends of the system count the same 
frequency or even multiples thereof. This requires that 
the nominal frequency of the voltage controlled oscilla- 



tor be fairly close to the frequency of the system clock at 
the encoder. 

[0024] The foregoing approach provides rather 
rapid synchronization but may introduce a long term 
error. Long term error LTE is proportional to the differ- 
ence 

LTE => ILCR n - LCR 0 I - ISCR n - SCR 0 I 

where SCR 0 and LCR 0 are for example the first occur- 
ring SCR and the corresponding latched value of the 
receiver counter. Nominally the error signals E and LTE 
will vary in discrete steps. As such, once the system is 
"synchronized" the error signal will dither one unit about 
the null point. The preferred method of synchronization 
is to initiate control of the voltage controlled oscillator 
using the error signal E until a one unit dither occurs in 
the error signal E, and then to switch to the use of the 
long term error signal LTE to control the voltage control- 
led oscillator. 

[0025] The system clock signal provided by the 
VCXO 37 may be utilized to operate at least the trans- 
port processor and rate buffers. Since it is synchronized 
at least in frequency with the encoder system clock, the 
possibility of rate buffer overflow or underflow due to 
clock timing errors is substantially non-existent. 
[0026] Refer again to FIGURE 2 for an explanation 
of audio/ video synchronization. Recall that a presenta- 
tion time stamp PTS vidi is included in the compressed 
video signal associated with predetermined video data. 
The PTS vid is indicative of the relative time that the 
associated video is to be displayed. Similarly the com- 
pressed audio signal includes presentation time stamps 
PTS aud associated with audio to be reproduced at times 
related to the respective PTS aud . At the receiver the 
PTS aud and PTS v j d may not be compared directly to 
provide A/V synchronization because the respective 
samples were determined at different instants. The 
respective PTS values are compared to a continuous 
time base which is the receiver clock provided by the 
VCXO 37. This is done by sampling local count values 
LCR generated by the system clock 208 to capture local 
time stamps. 

[0027] When data associated with a corresponding 
PTS is presented, the LCR is sampled. For example, 
the audio decompressor 212 issues a PTS aud when a 
respective audio frame is output for reproduction. At 
these times a control signal conditions the latch 220 to 
sample the LCR, the values of which will be designated 
LAS, for local audio stamp. Similarly when the video 
decompressor provides a video frame for display, it pro- 
vides a PTS vid and a control pulse to condition a latch 
222 to store the current value of the LCR. These LCR 
values are designated LVS for local video stamps. 
[0028] The LAS and the corresponding PTS aud are 
coupled to respective input terminals of a subtracter 21 8 
which develops the signal A A _ PTS according to the rela- 
tion; 
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A A-PTS = PTS aud- LAS 

The LVS and the corresponding PTSvid are coupled to 
respective input terminals of the subtracter 217 which 
develops the signal A v _pts according to the relation; 5 

A v . PTS = PTS vid -LVS. 

The signals A V . PTS and A A _ PTS are coupled to respective 
input terminals of a further subtracter, 21 9, which devel- w 
ops an A/V synchronization error signal ERR PTS 
according to the relation; 

ERR PTS = A V _ PTS - A A . PTS 

15 

Synchronization of the audio and video requires that the 
A/V synchronization error be driven to zero. Thus, when 
the difference in the values of the corresponding audio 
and video PTS's equals the time, in units of the local ref- 
erence, between the occurrence of the corresponding 20 
PTS's, the audio and video signal will be in synchroniza- 
tion. 

[0029] Remember, however, if the synchronization 
error exceeds a particular value dependent upon the 
available audio decoder memory the audio and video 25 
components cannot be synchronized and to attempt to 
do so will result in lost audio data or the production of 
undesirable reproduced audio artifacts. Thus for error 
values greater than a particular threshold, A/V synchro- 
nization should be suspended. 30 
[0030] Two mechanisms may be used to adjust the 
A/V synchronization based upon the error signal 
ERR PTS ; skips and repeats of data sections and con- 
version clock deviation. Skipping fixed intervals or 
"frames" of audio advances the audio data stream by a 35 
fixed interval relative to the video signal. Repeating (or 
muting without consuming data) delays the audio data 
stream by fixed intervals relative to the video signal. 
Skipping and repeating audio frames is audible under 
many conditions, and therefore is only utilized for coarse 40 
adjustment of synchronization. Even so, brief skipping 
or repeating may be preferable to discernible 
audio/video synchronization errors. If the audio frames 
are less than 40 msec, coarse adjustment by skip- 
ping/repeating may result in synchronization errors 45 
within ±20 msec, which is within industry standards for 
A/V synchronization. However, this synchronization will 
degrade if the audio conversion time base does not 
match that of the source. Once synchronization is 
coarsely adjusted, variations of the audio conversion so 
clock frequency are arranged to further refine the A/V 
synchronization. 

[0031] The error signal ERR PTS is applied to a filter 
and processing element 216. A filter function therein 
smoothes the signal ERR PTS to minimize aberrant 55 
effects that might otherwise be generated by signal 
noise. The processing portion of element 21 6 examines 
the smoothed error signal and determines whether a 



skip/repeat of audio should be utilized to effect coarse 
synchronization of the audio and video signals and/or 
whether an adjustment to the audio processing fre- 
quency should be utilized to effect fine synchronization, 
or whether neither should be performed. If a coarse syn- 
chronization adjustment is determined to be necessary, 
the processor 21 6 provides a control signal (SIR) to the 
audio decompressor 21 2 to condition the decompressor 
to skip or repeat the current decompressed audio 
frame. Alternatively, or in addition to the coarse adjust- 
ment, if a fine adjustment is determined to be neces- 
sary, the processor 21 6 provides a control signal to the 
audio time base 21 5 to adjust the frequency of the audio 
processing clock signal. 

[0032] The processing algorithm is detailed in the 
flow chart of FIGURE 4. After the initialization step 400 
of the system, which is designated START, the system 
monitors (401) the audio decompressor for the occur- 
rence of a PTS aud and if a PTS aud is detected it is read 
at step 403 and a local clock reference LAS is captured 
and stored. If a PTS aud has not occurred, the system 
monitors the video compressor for a PTS vid (402). If a 
PTS vid has occurred, the PTS vid is read and a local 
clock reference LVS is captured and stored. When both 
a PTS aud and a PTS vid have been read the ERR PTS is 
calculated at step 405 according to the equation; 

ERR PTS = = A V . PTS - A A _ PTS 

[0033] The magnitude of the error signal is exam- 
ined (420) to determine if it is greater than a predeter- 
mined maximum value dependent upon the size of 
delay storage memory in the audio decoder. If the error 
is greater than this maximum, a check is made at step 
421 to determine how long this condition has existed. If 
it is greater than a predetermined, or selectable, time of 
N seconds, then the synchronization process is sus- 
pended (422). If the condition has existed for less than 
N seconds, the synchronization continues at step 406. 
Because the skip and repeat functions react differently 
with respect to audio memory, the threshold of N sec- 
onds may be different for positive and negative values of 
the error signal. 

[0034] The magnitude of the error signal is exam- 
ined at step 406 to determine if it is greater than, for 
example, one half an audio frame interval. If it is greater 
than one half an audio frame interval, the error signal is 
checked for polarity at step 407. If the polarity is posi- 
tive, the current audio frame is repeated (409). If it is 
negative the current audio frame is skipped at step 408. 
After skipping or repeating a frame the system iterates 
back to the start position to wait for the next occurrence 
of PTS's. 

[0035] At step 406, if the magnitude of the error sig- 
nal is less than one half an audio frame interval, the 
error is examined at step 41 0 to determine if it is greater 
than zero. If the error is greater than zero, the error is 
checked at step 412 to determine if it is less than the 
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previous error signal. If it is less than the previous error 
signal, this is an indication that the system is converging 
toward synchronization, and the synchronization control 
parameters are not changed. The system returns to the 
start position to wait for the next PTS's. Conversely, if 5 
the error has increased over the previous error signal, 
the audio system processing clock is adjusted at step 
414 to lessen its frequency. 

[0036] At step 410, if the error is less than zero 
(negative), it is checked at step 411 to determine w 
whether it is greater than the previous error signal. If it is 
greater than the previous error signal, this is also an 
indication that the system is converging toward synchro- 
nization, and the synchronization control parameters 
are not changed. Alternatively if the current error signal 15 
is less than the previous error signal the system is mov- 
ing further out of synchronization and the audio 
processing clock frequency is increased at step 413. 
After processing steps 412 and 413 the system returns 
to wait for the next occurrence of PTS's. It will be noted 20 
in this example, that the system performs only coarse 
adjustments via skipping or repeating audio frames until 
the A/V synchronization error is reduced to less than 
one half an audio frame interval. 

[0037] In an alternative embodiment, the filtered 25 
error signal is compared against a predetermined 
threshold related to the size of respective audio frames. 
If the error signal is less than the threshold, indicative 
that the audio-video timing error is less than an audio 
frame, the error signal is coupled to the audio time base 30 
circuit 215, wherein it is utilized to adjust the frequency 
of the audio signal processing (decompression) clock. 
Alternatively, if the error signal is greater than the 
threshold, the error signal may be divided by the audio 
frame interval to determine the number of audio frames 35 
that the audio and video signal are misaligned. The inte- 
ger portion of the quotient is applied to the audio 
decompressor to condition the audio decompressor to 
skip or repeat that number of audio frames. The polarity 
of the error signal will determine whether audio frames 40 
should be skipped or repeated. Nominally the com- 
pressed data is arranged in a buffer memory prior to 
being decoded, hence skipping or repeating audio 
frames is a simple matter of controllably enabling the 
memory read/write commands. 45 
[0038] The fractional portion of the quotient is cou- 
pled to the audio time base circuit 215, where it is uti- 
lized to adjust the audio processing clock to fine tune 
the A/V synchronization. 

[0039] The rate of production of audio PTS's is pro- 50 
portional to the processing speed of the audio decom- 
pressor. The processing speed of the audio 
decompressor is directly proportional to the frequency 
of the clock signal used to operate the audio decom- 
pressor. If the clock frequency of the audio decompres- 55 
sor is independent of the clock used to operate the 
video decompressor, and is finely adjustable, then the 
relative rate of occurrence of the audio and video PTS's 



can be adjusted and the A/V finely synchronized. 
[0040] Decompressed audio signal is coupled to a 
digital-to-analog converter (DAC) 227. Analog output 
signal from the DAC 227 is coupled to further analog 
processing circuitry (not shown) via a resistor 228. The 
conduction path of a muting transistor 229 is coupled 
between the resistor 228 and ground potential. The con- 
trol electrode of the transistor is coupled to an output 
connection of a threshold detector 225. A positive con- 
trol voltage of greater than one V be will condition the 
transistor 229 to clamp the audio output signal from 
DAC 227 to ground potential, thereby muting the audio 
signal. 

[0041] Typically a compressed audio signal will 
include a plurality of components such as a left signal, a 
right signal, etc. FIGURE 2 shows only one audio output 
for simplicity, however each audio channel will include a 
muting-circuit controlled by a common muting control 
signal. 

[0042] Muting the audio signal may be performed 
for various reasons, not the least of which is loss of lip- 
sync. The current apparatus is unique in that it mutes 
audio based on lip-sync error. 

[0043] A human viewer will notice lip-sync errors of 
-20 ms or + 40 ms. In the illustrated exemplary system, 
audio is muted if the lip-sync error exceeds approxi- 
mately 13 ms. The 13 ms threshold was chosen to be 
less than 20 ms but greater than one-half an MPEG1, 
layer II audio frame (which is 24 ms). The threshold was 
selected to be slightly greater than one half the audio 
frame because with A/V synchronizing frame skips and 
repeats, an initial synchronizing state at a half frame (12 
ms) is possible, and a threshold of approximately 12 ms 
or less could result in synchronized, but muted audio. In 
addition, a threshold equivalent to one half frame pro- 
duces intermittent muting due to minor fluctuation in cal- 
culated lip-sync errors due to clock and PTS strobe 
sampling uncertainties. 

[0044] The muting control signal is generated by 
the threshold detector 225, which monitors the A/V syn- 
chronization error signal available from the subtracter 
219. When the error signal exceeds a value correspond- 
ing to 13 ms, a muting control value is generated. To 
preclude noise or other impulsive conditions from caus- 
ing generation of false muting signals, the error signals 
from the subtracter 219 may be low pass filtered before 
application to the threshold detector 225. 
[0045] Dashed arrows emanating from the detector 
225 and terminating on the DAC 227 and the audio 
decompressor 212 indicate alternative muting possibili- 
ties. For example, the muting control signal may be 
designed for disabling the output of the DAC 227 or the 
output of the decompressor 212. In either instance the 
disabling function should be arranged to condition the 
respective processing element to output a signal ampli- 
tude value midway between the output signal dynamic 
range. The FIGURE 2 arrangement also indicates that 
video muting (or blanking) may be implemented via con- 
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trol of the video D AC 224. 

[0046] Muting control may be suspended when syn- 
chronization is too far from convergence. Detector 225 
may be configured to compare the error signal with a 
maximum threshold, and suspend muting if the error 5 
signal exceeds this threshold, or if it exceeds the maxi- 
mum threshold for a predetermined time, or simply if 
muting occurs for longer than a predetermined interval. 
A muting suspension control may also be provided by 
the processor 21 6 at step 422. 

[0047] FIGURE 5 illustrates an alternative muting 
arrangement. In this embodiment the skip/repeat (S/R) 
signal generated by the filter 216 is used as a primary 
muting control. This signal is coupled to the muting cir- 
cuitry via an OR gate 230 and an AND gate 233. The 
AND gate 233 is used as a mute override responsive to 
a SUSPEND MUTE signal from the processor 216. 
AND gate 233 is really symbolic in nature, because sus- 
pension of muting will likely be effected within the mute 
control signal generator itself. 

[0048] If in a particular system the fine control used 
to adjust the audio time base spans a range encom- 
passing at least one half an audio frame, a threshold 
detector, 231 , may be included to monitor the audio time 
base fine control signal and generate a secondary mut- 
ing control signal. This secondary control signal is also 
coupled to the muting circuitry via the OR gate 230. 
[0049] The apparatus of FIGURE 5 illustrates a fur- 
ther possible muting circuit, consisting of an AND gate, 
226, to selectively couple the decompressed audio sig- 
nal from the decompressor 212 to the DAC 227. Nomi- 
nally audio signals are bipolar (AC) and swing about 
zero value. The AND gate when in the decoupled state, 
desirably outputs zero value which is the middle of the 
signal dynamic range. 

[0050] Currently, compressed A/V direct broadcast 
satellite systems are evolving in which a plurality of 
packeted programs are time division multiplexed and 
conveyed via a single transponder. A particular program 
may include only an audio signal, obviating lip-sync 
problems. However, undesirable audio may be repro- 
duced if the system clock has not synchronized. There- 
fore, a further threshold detector, 232, may be included 
to monitor the error signal generated by the clock con- 
troller 39 shown in FIGURE 3. The threshold detector, 
232, generates a muting control signal when the error 
signal produced by the controller represents a fre- 
quency deviation from the locked condition by, for exam- 
ple, 0.2. This muting control signal is coupled to the OR 
gate 230 to effect audio muting until the system clock is 
substantially synchronized with the corresponding 
encoding system clock. Similarly, detection apparatus 
may be coupled to measure the frequency deviation of 
the audio time base 215 for generating a further muting 
signal which may be ORed in the gate 230. 



Claims 

1. In a receiver for processing compressed 
audio/video signal, apparatus characterized by: 

a detector (200, 202, 206) for providing said 
compressed audio/video signal; 
an audio decompressor (212), responsive to 
said compressed audio/video signal, for provid- 
ing decompressed audio signal; 
a video decompressor (214), responsive to 
said compressed audio/video signal, for provid- 
ing decompressed video signal; 
an error detector (21 7-222), responsive to time 
stamps provided with decompressed audio sig- 
nal and decompressed video signal, for gener- 
ating an error signal which is a measure of non- 
synchronization of associated decompressed 
audio and decompressed video signals; 
a processor (216), responsive to a first range of 
values of said error signal for controlling said 
audio decompressor to skip or repeat seg- 
ments of decompressed audio signal for syn- 
chronizing associated decompressed audio 
and video components, and responsive to error 
signals greater than said first range for prevent- 
ing synchronization of associated decom- 
pressed audio and video components. 

2. The apparatus set forth in claim 1 characterized in 
that said processor includes: 

timing circuitry for timing the duration of said 
error signal being greater than said first range; 
and 

wherein said processor prevents synchroniza- 
tion of associated decompressed audio and 
video components only when the duration of 
said error signal being greater than said first 
range exceeds a predetermined time period. 

3. The apparatus set forth in claim 1 or 2 further char- 
acterized by muting circuitry, responsive to said 
error signals for muting decompressed audio signal 
for a given range of values of said error signal. 

4. The apparatus set forth in claim 3 characterized in 
that said muting means comprises: 

a threshold detector, responsive to said error 
signal, for generating a control signal having 
first and second states when said error signal is 
greater and lesser than said predetermined 
threshold respectively; and 
signal clamping means, responsive to said 
control signal, for passing said decompressed 
audio signal when said control signal exhibits 
said second state and providing a substitute 
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value for said decompressed audio signal 
when said control signal exhibits said first state. 

5. The apparatus set forth in claim 4 further character- 
ized by: 5 

synchronization circuitry, responsive to said 
compressed audio/video signal, for generating 
a system clock signal having a predetermined 
relation to an encoder system clock signal used w 
in generation of said compressed audio/video 
signal, and including a detector for providing a 
further error signal which is a measure of non- 
synchronization of said system clock signal; 
a further threshold detector, responsive to said 15 
further error signal, for generating a control sig- 
nal having first and second states when said 
further error signal is greater and lesser than a 
further predetermined threshold respectively; 
and 20 
ORing circuitry for combining said control sig- 
nals from said threshold and further threshold 
detectors. 

6. The apparatus set forth in any of claims 1 to 5 char- 25 
acterized in that said error detector is a lip-sync 
detector. 

7. The apparatus set forth in any of claims 1 to 6 char- 
acterized in that audio and video components of 30 
said audio/video signal include respective time 
stamps PTS aud and PTS vid determined at predeter- 
mined times and related to an encoder system 
clock, and said audio and video decompressors 
provide said time stamps PTS aucj and PTS vid with 35 
associated decompressed audio and video signals, 
and said error detector includes: 



8. The apparatus set forth in any of claims 1 to 6 char- so 
acterized in that audio and video components of 
said audio/video signal include respective time 
stamps PTS aud and PTS vid determined at predeter- 
mined times and related to an encoder system 
clock, and said audio and video decompressors 55 
provide said time stamps PTS aud and PTS vid with 
associated decompressed audio and video signals, 
and said detection means generates said error sig- 



nal as a function of the difference between the val- 
ues of said PTS aud and PTS vid for associated 
decompressed audio and video. 

9. A method of synchronizing audio and video compo- 
nents in a compressed audio/video receiver char- 
acterized by: 

determining values indicative of timing of non 
synchronization of decompressed audio and 
video signals; 

effecting synchronization of said decom- 
pressed audio and video signals if said values 
are less than a predetermined time and provid- 
ing synchronized or partially synchronized 
decompressed audio and video output signals; 
and 

preventing/suspending synchronization proc- 
esses of said decompressed audio and video 
signals if said values are greater than said pre- 
determined time and providing non-synchro- 
nized decompressed audio and video output 
signals. 

10. The method of synchronizing audio and video com- 
ponents set forth in claim 9 further characterized by 
the steps of: determining the duration that said val- 
ues are greater than said predetermined time; and 

suspending synchronization of said decom- 
pressed audio and video signals only if said 
duration is greater than a predetermined dura- 
tion. 

11. The method of synchronizing audio and video com- 
ponents set forth in claim 9 or 1 0 further character- 
ized by the steps of: 

comparing the values indicative of timing to a 
range of values; 

muting said audio signal if said values are 
greater than a further predetermined time and 
said system is effecting synchronization. 
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a source of a local clock signal; 

means for determining, in cycles of said local 40 

clock signal, the time T between occurrences 

of corresponding time stamps PTS aud and 

PTS vid; 

means for calculating the difference between 
values of corresponding time stamps PTS aud 45 
and PTS vid and comparing such difference with 
said time T to generate an A/V synchronization 
error signal. 
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